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ASYMPTOTIC ERROR FOR THE MILSTEIN SCHEME FOR SDEs 
DRIVEN BY CONTINUOUS SEMIMARTINGALES 

By Liqing Yan 

University of Florida 

A Milstein-type scheme was proposed to improve the rate of con- 
vergence of its approximation of the solution to a stochastic differ- 
ential equation driven by a vector of continuous semimartingales. A 
necessary and sufficient condition was provided for this rate to be 1/n 
when the SDE is driven by a vector of continuous local martingales, or 
continuous semimartingales under an additional assumption on their 
finite variation part. The asymptotic behavior (weak convergence) of 
the normalized error processes was also studied. 

1. Introduction. We consider a general g-dimensional stochastic differ- 
ential equation (SDE) driven by a vector of continuous semimartingales 
y G M*^ on time interval [0, 1] with the starting point xq G K'^, 

(1) Xt = xo+ f' f{Xs)dYs, 

Jo 

where / denotes a matrix of functions from into (8> M*^. This equation 
includes the classical Ito-type SDE: 

(2) Xt = xo+ f a{Xs) dWs + /* h{X,) ds, 

Jo Jo 

with a, b matrices of functions and W a multidimensional Brownian motion. 
We refer to [10] for the stochastic integral with respect to a semimartin- 
gale. In applications, one wants to find the expectation of some functional 
of the solution of the SDE (1), for example, such quantities can be the prices 
of financial derivatives, such as options. Due to the simulation difficulties. 
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one usually combines a numerical solution X", with a Monte Carlo tech- 
nique to approximate the expectation. For example, one can use E[/i(Xp)] 
to approximate E[/i(Xi)] and use the Riemann summation 




Of cause, {E[/i(XJ*)], i = 1, . . . , n} are unknown, however, Monte Carlo method 
can be used to estimate them by simulating the paths of the numerical so- 
lution X". 

A widely used numerical method for the SDE (1) is the following continuous- 
type Euler scheme X", which is given by 

= ^n{t) + / i^n{t) ) O^t ~ ^n{t) ), Xq = Xq, 

where n{t) = k/n, \i k / n < t < {k + 1) / n. Many authors have studied the 
various convergence criteria for the Euler scheme under different cases of 
the matrix function /(•) and the driving process Y . For reviews, see [12] 
or [5]. Talay and Tubaro [13] obtained the celebrated expansion of the global 
error of the Euler scheme. Protter and Talay [11] studied the Euler scheme 
for SDEs driven by Levy processes. Kohatsu-Higa and Protter [6] studied 
the Euler scheme for SDEs driven by semimartingales. Yan [14] proved the 
convergence of Euler scheme for SDEs without continuity assumption of /(•)) 
and obtained the rate of convergence without Lipschitz conditions on /(•). 

For SDE (2), if a = 0, then the rate of convergence of the Euler scheme is 
classically 1/n; if a does not vanish, it is also classical that the rate is 1/^/n. 
However, the distribution of the normalized asymptotic error for the Euler 
scheme was established only recently, for SDE (2) see [7], and for SDE (1) 
see [2]. A necessary and sufficient condition in [2] (Theorem 1.2) was given 
for the rate of convergence of the Euler scheme to be 1/ y/n when the driving 
semimartingale is a continuous local martingale and / is a function, that 
is, y/n{X^ — X) converges weakly to a process U , which is a solution of a 
known linear SDE with some additional randomness. 

The rate of convergence of an algorithm certainly depends on the smooth- 
ness of /. If / is in C^, we can modify the Euler scheme to improve its rate 
of convergence and study its normalized asymptotic error. It is well known 
that (see Chapter 10 of [5]) the Milstein scheme for SDE (2) with the addi- 
tion of two more terms to the Euler scheme, which is given by (for the case 
of (i = g=l), XJ = and 

= + h{XZ^,)){t - n{t)) + l6(X;^(,))6'(X:(,))(t - n{t)f 

(3) +a(X^(,))(T^t-Ty„(,)) 

+ ia(x;^(,))a'(x;^(,))[(Tyf - w^^^.^f -{t- nm, 
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increases the rate of convergence from 1/n to 1/n? when a = 0, and increases 
the rate from l/y^ to 1/n when a does not vanish. In this paper, motivated 
by this fact, we give a class of SDE (1) that the Milstein scheme converges 
weakly to the solution X of SDE (1) at the rate of 1/n and determine its 
asymptotic error, that is, the weak limit of n(X" — Xj), when / is a matrix 
of functions. 

Our result is of mathematical interest only, since the Milstein scheme 
involves stochastic integrals which cannot be simulated exactly (except, in 
the Brownian case, when column vectors of the diffusion coefficients, seen as 
vector fields, commute in the Lie bracket sense). From an applied point of 
view, the discretization error needs to be studied in the weak sense (the law 
of the underlying process). It is now well established that errors in pathwise 
sense or in norm lead to crude sub-optimal estimates in the weak sense. 
For example, for the Euler scheme X" for the solution X, ^/n(X^ — X) 
converges weakly to a nonzero process (see [2]) and n{E[f{X'^)] — E[f{X)]) 
converges to a nonzero constant (see [13]). However, a recent work by Kbaier 
[4] uses the stable weak convergence of the normalized pathwise error in 
order to get a useful estimate in the weak sense for the Euler scheme for 
Brownian SDEs. Those technical results might help to provide estimates in 
the weak sense for schemes (such as the Milstein scheme), which discretize 
SDEs driven by general semimartingales and can be simulated. 

We refer to the section of preliminaries in [2] for the weak convergence 
in the Skorohod topology, which is denoted by See [8] and [9] for 

an expository account about weak convergence of stochastic integrals. For 
readers' convenience, we give a definition of the (*) property for a sequence 
of continuous semimartingales below. See [2] for the general case. 

Definition. We said that a sequence of continuous semimartingales 
X" satisfies (*), if [M",M"]i + is tight, where = + A^, 

is a local martingale and is a finite variation process, [M",M"']i is the 
quadratic variation of M" over [0, 1], is the total variation of A" over 

[0,1]. 

In this paper we only consider the continuous process Y. From [2] and 
this paper, especially Theorem 2.2 below, we can imagine that the error dis- 
tributions in the discrete case could be much more complicated. In Section 2 
we develop a fundamental result on the error distribution for the Milstein 
scheme for the SDEs driven by a vector of continuous semimartingales Y 
(not just one Brownian motion), where we employ Kurtz and Proffer's idea 
in [7], and overcome several technical difficulties for the Milstein scheme, 
a more complicated and accurate algorithm in high-dimensional cases than 
the Euler scheme. Various convergence rates are obtained for three cases: 
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(i) y is a continuous process of finite variation in Section 3, 

(ii) y is a continuous local martingale in Section 4 and 

(iii) y is a continuous semimartingale in Section 5. 

An example as the application of our theory is shown in Section 6 and seven 
lemmas are given in Section 7. 

2. Results on the error distribution. Let Y = (y*)i<i<rf be a vector of 
continuous semimartingales on a stochastic basis {Q,J-',J-t,P) with Yq = 
0. We consider the g-dimensional SDE (1) on time interval [0, 1] with the 
starting point xq = (xg, • • • , XqY £ '^'^ and / being a matrix of functions 
from into M"^ (g) M'' with at most linear growth [i.e., < c(l + 

for some constant c]. Then SDE (1) has a unique strong solution X, see [3]. 

There are many numerical methods to approximate the solution X , such 
as the Euler scheme. In this paper we consider the Milstein scheme X"' = 
...,X'"iy, which is defined by XJ} = xo, and 

xr = x:(\) + f(x;^(,))(y,-y.(,)) 

+ tr )VDf{x::^,^ ) (y, - y„(,)) dy;) , 

where tr(A) is the sum of the diagonal elements of matrix A, /* is the zth row 
vector of /, and DP = (f^^ := dp^ /dxk) is a q x d matrix, for i = 1, . . . ,q. 
This scheme X"* can be written as 

(4) Xr = xl + jj\Xl^^^)dY, + j\Y,-Y^^,^Y[f{Xl^^^)YDP{X^^^^)dY,. 

See (54) for an example of this Milstein scheme for SDE (2). The error pro- 
cess of the Milstein scheme is denoted by C/" = X" — Xt. We prove that, 
when / is a function and y is a continuous local martingale, nU"" con- 
verges weakly to a nonzero process U , which satisfies a known linear SDE 
(8). 

First we can use the same arguments as in Theorem 3.1 in [2] to show the 
convergence of the Milstein scheme X"^ to the unique solution X of SDE (1). 

Theorem 2.1. If f is a C'^ function with at most linear growth, then 
X" and -^"(.) go to X in probability. 

Before examining rates of convergence, we introduce some notation below. 
For a number < s < 1, let s^"'^ = s — n{s), where n{s) = k/n\ik/n<s< 



(fc + l)/n. 


For i 


(5) 




(6) 




(7) 
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The (j, /c) entries of the matrix M"* and A^"* are denoted by M"*-''"' and 
jymifc^ respectively. Let M" = (M"\ . . . , M"'^), and TV" = (7V"\ . . . , A^"'^). 

Theorem 2.2. Lei i//*-' be the Hessian matrix of and Un be a de- 
terministic sequence of positive numbers. There is equivalence between the 
following: 

(i) The sequence {anM^ ,anN'^) has (*) and 

{Y,anM^,anN^) =^ {Y,M,N). 

(ii) For any starting point xq and any function f with at most linear 
growth, the sequence OLnU"^ has (*) and {Y,anU^) ^ {Y,U). 

In this case, the limits M = (M\ . . . , M'^),N = {N^,..., N"^) andU = {U^ , . . . , 
W^y can be realized on the same extension of the space on which Y is de- 
fined, and they are connected by 

Ui= f{UsYDf\Xs)dYs 
Jo 

(8) - E E ( r flHXs)[Df''{Xs)Vf{Xs) dMi 

j=ik=i 

- lEtrf f[f{Xs)YHf'^{X,)f{Xs)dN^ 

and Uq = for i = 1, . . . ,q. 

Proof. First we estabUsh a connection between {U^,M"',N"') through 
a SDE (18). Let R'l = {Rf, . . .,RfY, where 

(9) RT = j\f{X^^s)Ws{Y)YDf\Xl^^^)dYs. 
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Let /i* = {DfYf by (5), we have 

(10) Rr = [h\x:^^))A^:{Y)j ' dv, = tr (|J h\x:^^)) dz: 

(11) A^(i?™) = tr(/i^(X^(,))A^(Z")). 

By the definition of the Milstein scheme in (4), 

(12) X^ = xo + |^*/(^^(.))dy. + iir, 
and therefore, 

(13) A^(X") = /(X„"(,))A:(y) + A^{R-). 

Integrating the transpose of both sides of (13) with respect to Dp{X^^^^) dYs, 
we have 

(14) j\AUX^^WDf{X::^^^)dY, = j\A^{R-WDriX::^^^)dY, + Rr, 

see (9) for the definition of i?™. From (12) and SDE (1), we get an expression 
for Uf = Xf - Xl 



(15) Ur = j\fiX2) - f{Xs)] - [fiX2) - r (^^(.))] dYs + R^ 



By Taylor's expansion, there exist between X" and Xg, and between 
X^ and . such that 

A n(s) 

(16) f{Xl^) - f{Xs) = {uiTDfiC), 

(17) f{X^) - nX^(s^) = [A^(X")]-Z)r (X^(,)) + \{Hf\ . . . ,i/™'^), 
where 

HT^ = [A^iX^WHf^iOA^iXn. 
Combining (15), (16) and (17), we have 

ur= f\u2yDr{c)dYs+Rr 

Jo 

- J\A:{x-)rDrix:^^^) dY^-iY. [ dYi. 

From (14), 

ur= f\KyDf\c)dYs 

Jo 



\A^iR-)VDf{X:^^^)dYs - i E dYi. 

.7 = 1 



THE MILSTEIN SCHEME 

By (11) and (6), 

d q 



j=ik=i ^-^o ^ 



The above equation about [/" can be written as follows: 

d q 



(18) 



-^Etr(^/J/7r(e)A^(x")[A:(x")r(iy/). 

Now we prove the implication (i) =^ (ii). First we assume that f,Df,Hf 
are all bounded. Since A"(y), A"(Z") and A^(i?") go to zero as n ^ oo 
[see (11)], by the definition of in (6) and Theorem 2.1, we have 

an A^iRn dYi = an tr(^J^ h^X^i^^^) dM^^^ =^ tr h'{Xs) dMi 
and 

an 1^ A-iR-) [A^iR^Y dY.Yan A^(y) [A^(ii")]- dY^^ =^ 0. 

From (13) and the definition of in (7), we have the following identity in 
the sense of weak limit (^=^): 



hm trfa„ / Fr(e)A?(X")[A^(X")]- dF/ 

(19) = \\^jT[an jj^{Xl^s))Hr{Cs)f{Xl(^^^^^^ 

= Jim tr (^a„ f {Xl^,))H P {C)f{X:^is)) dK^ 
= tr r(X,)i?/^^(X,)/(X,) diVi 



where (19) is due to the fact that iT{AB) = tT{BA) for matrix A and B. 
And now it follows that 



y, a„M«, a„ tr /^^(e) A^(X") [A^(X")]- dY^ 



i<i<q,i<j<d 
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(20) 



Y,M,tv(f r{Xs)Hf^{Xs)f{Xs)dNi)) 

\J0 / / l<i<q,l<j<d 

and the sequence in the left-hand side of (20) has {-k) since [unM'^ ^ anN"^) 
has (*). 

A similar argument (stopping time) as in the proof of part (a) of Theo- 
rem 3.2 in [2] shows that the weak convergence in (20) and (*) property are 
also true for any function / in (ii). Now the implication (i) ^ (ii) and the 
equation (8) follow from the above statement and (18) through the same 
arguments as in Theorem 3.2 of [2]. 

The implication (ii) ^ (i) is proved through (18) with the starting point 
xq = (1, . . . , 1) G M'' and some special matrix function /, which is defined 
below so that and A^" can be represented as integrals with respect to 
and Y. 

Let's consider M" first. When d = 1, we choose the SDE Xl = l + /g X] dY} , 
obviously, X^ =£{Y^), which does not vanish and X"^ does not vanish ei- 
ther on [0,1] for n large enough. Prom (18), we have dC/"^ = U^^ dY^ — 
X^^^^ dM^^^"^ , which is equivalent to 

dMf^^ = c/rv^„t) - V^nW dU^'- 

From Theorem 2.3 in [2] and Theorem 3.1, it follows that a„M" has (*) and 
(y,a„M")^(y,M). 

When d = 2, we consider the SDE Xl = l + a /g X] dY} + hY^ , where a, h 
are two constants. Equation (18) becomes 

(21) dUf = aU''^ dY^ - A:^(\) dUf^^ + h dM''^'^^ . 

When a = 1,6 = 0, let C/"i" = = A"^ When a = 1,6 = 1, let 

^nui ^ f/ni^ js;^ni2i ^ x^'K Equation (21) leads to two equations for M^^^S 

^nl21. 

(22) dM^^i^ = C/r''V^:S' dY,' - 1/X:jf dU-''\ 

(23) dM"i2i = -dU^^^^ + C/"i2i - X^Jf dM^^^\ 

For M"^^^M"^22^ consider the SDE in (1) with xq = (1, 1)^ and 

f- 



Then 



o Xl 

XI 
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and /f^ = 1, all other f^^ are zeros. Letting i = 2 in (18) we have 

(24) df/i"2 = [/'^i dYt^ - a dMf^'^ - dMi"i22 ^ 

When a = 0, = SiY"^), which does not vanish and X""^ does not van- 
ish either on [0,1] for n large enough. From (24), we can get an equation 
for M"122 ,^22) for M"i". When a = 1, from (24), we can get an 

equation for M"^^^ as in (23) for M"^^"^. Now we have four equations for 
M"-^ = (M"'^*-')2x2- Switching the positions of and Y"^ gives us four sim- 
ilar equations for M'"^. Again, from Theorem 2.3 in [2] and Theorem 3.1, it 
follows that a„M" has (*) and (y,a„,M") ^ (r,M). 

When d > 3, we choose three different fixed integers i, j, k between 1 and 
d, and define 

/"'^(xi, a;2, . . . ,Xd) = X2{hi + I2A:) + A/2i + Phj, 

where lij is a d x d matrix with its entry being 1 and other entries 

being zero, A and /3 can be either or 1. 

Denote X^{f) the ith component of the solution X of SDE (1) with the 
starting point xq = (1, . . . , 1) G W. Let X^''^ = X'^{f'^), X^^^ = X^{f'^), 
Xiik ^ x^if'^), X^i^ = X\f'^), X^^^ = X\(/0'i). Naturally, let X™^'' be 
the Milstein scheme for X'^'' and [/"^■''^ = X'^^i^ - X^^^ for any 1 < i, j, /c < d. 
Since X^^'^ = £{Y^), it does not vanish and X"^^^^ does not vanish ei- 
ther on [0,1] for n large enough. Letting i = 1 or 2 in (18), with / being 
jOfi ^ jifi ^ ^ respectively, we have 

(25) dUf^^ = Uf^^/X^l^ dYj' - 1 /X;^Jf d^7f"*^*^^ 

(26) dMl''''^ = Ul'^'^'^/Xllf^f dY; - 1/X^5f dUl''''^ 

(27) dMr'' = Uf'^ dYi - dUr^ - dMf''^ , 

(28) duf'^ = Uf^'' dYt - dUf^'' - X;;^' dMf''^ , 

(29) dM^^'' = Uf^^ dYi> - dUf^^ - Xf^^^ dMf^^ . 

For the error processes f/*^, under the assumption of (ii), we have that 
(y, a.C/™^■^ ^Jf )i<,,,fc<, =^ {Y, X^'')i<^,,,k<d, 

and the left-hand side of the above formula has (★). From Theorem 2.3 in 
[2] and Theorem 3.1, it follows that 

(30) OnM" has (*) and {Y, aU'',anW) =^ {Y, U, M). 

Next, let us consider A^". Let F{x) = J^e'^^/'^dy, for x,y e When 
d>3, we choose / as follows, for fixed k between 1 and d: 

f{xi,X2,...,Xd) = F{x2)F{x3)Iii + X2l2j + X-^hj ■ 
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Letting i = l in (18), we have 

rt d q 



d 

(31) 



I - II - illl. 



A simple calculation shows that Df^ = F'{x2)F{xs)l2i + F' {x^)F{x2)hi, 
Df = hj, Df = hk and Df = for / > 3; f^' = F'{x2)F{x^), fl' = 
F'{xs)F{x2) and //* = for I / 2 and / / 3, and all other Hessian matrices 
are zeros, except 

Hf'ix) = F"{X2)F{X3)I22 + F'(x2)F'(x3)(/23 + ^32) + F{x2)F" {x^)l33. 

(32) 

Therefore, 

II = tr fl\DfyfdMT^ + tr (^^* fl\DfYfdMT^ , 

III = tr Hf^\Cs)^:{x-)[^:{x-)Y dy;) . 

Since {DfYf = Ij2f = X^^l^Ijj, {DpYf = hsf = and tr(/,-,M™) = 

Mmn^ let Ff = F' {Xf^^^)F{Xf^^^)Xf^^^ and Ff = F'(X-f,))F(X-J))X-3 ), 
then 

(33) '\ Y" 

= I FfdMf^^+ I FfdMf^^. 
Jo Jo 

On the other hand. III can be split into two terms by (13), that is, 111 = 



Af + Bf, 




(34) Af = 


/ [/(^:{.))n^"^]^^/'^(e)[/(x„"(,))yi")] dv: 


(35) 


trj\f{X:^^^)YHf\a[f{X:^s))]dNi, 


(36) Br = 


J\a^{R-) + 2/(X„"(,))y i")]-F/i^(e)A^(i2") dv: 


(37) 


J\a^{R-) + 2f{X:^^^)Yj-^YHf\C) tiihYX:^^^) dMT 
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(38) =: f^HTtT{h\Xl^^^)dM^ 



s /I 



where (37) is due to /q A^(i?™) dV, = tr(/o /i*(^"(,)) dMf), which is from (11), 
and (35) is due to the definition of A^"* in (7); we write 

HT = [A^(i?") + 2/(x;^(,))yW]-i?/i^(C). 

To simphfy the notation, we can write (32) as Hf^^{^g) = 0/22 + ^(-^23 + 
732) + CI33, where a,b,c are corresponding function values at Using the 
fact that lijlki = Sjkhu where 5jk = 1 when j = k, Sjk = when j 7^ k; to 
simphfy the matrix product f^Hf^^f in (35), we get 

AT = tr (I + X:l^X:l^b{I,k + hj) + {Xll^fchu dK 

= (Xf^,^ fa dNf" + 2 j'^ X^l^X^l^ b dN"^'^^ + / ^^"^ ' ^ " "'"''^ 



n{s)) (^dNl 



where we use the fact that N"^^ is a sysmetric matrix. Now let and ^"^ 
be the second and third component of Q , respectively, and let 



= {Xl(^^Ya={Xr^,)rF"{ir)F{ir); 
Gf = {Xll.fc = {XlUfF"{if)F{if) 



then ^™ can be represented as integrals with respect to N™ as follows: 

(39) Af = f Gf dNf^^ + /* Gf dNf^^ + /* Gf dN""'^^ . 
Jo Jo Jo 

Letting F^^ = {U^YDf^{Q), combining (31), (33), (38) and (39), we get 
an equation for U"" ,Y, M"" , N"" , 

Uf = f Ff dYs - t Gf dNf^^ - f Gf dNf^^ - t Gf dN""'^^ 



- / FfdMf^^- I FfdMf^^- I Hfti{h'{X'\s)dMl 
Jo Jo Jo ^ ' 

When j = k, the above equation can be written as 

Nf" = ['{Ff dVs - dUf 
Jo 

(41) - (Ff + Ff) dMT" - HT tr(/i^(X«(,)) dMT)} 

X {Gf + Gf + Gf}-^. 
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Since X"^ = £(Y'^) and = £(Y^) do not vanish, and X" converges, G"* 
also converges and does not vanish for large n for i = 1,2,3. Under (ii) in 
Theorem 2.2, H^'^,F^'^ and converge to finite processes. By Theorem 2.3 
in [2] and Theorem 3.1, it follows from (30) that 



(a„?7",a„M",aniV™^^)i<i.,<d has (*) and 

(42) 

(y, a„M", a„iV"^'^^)i<i,,<rf {Y, U, M, A^*^^)l<^,,<d. 

Prom (40) we can express N^^^ as the sum of integrals with respect to Y, 
jjn^j^njymjj g^j^j pjmkk ^ gy game arguments, it follows from (42) that 



we can get (43) accordingly. Now, the implication (i) (ii) has been proved. 
□ 

3. Continuous processes with finite variation. When the driving process 
Y in (1) is of finite variation and continuous, SDE (1) is an "w-wise" (de- 
terministric) equation. Without loss of generosity, we assume that y is a 
nondecreasing function of t in this section. It is classical that the rate of 
convergence of the Milstein scheme for an ordinary differential equations 
(ODE) is l/'n?. Here we give a necessary and sufficient condition for the 
rate being l/ii? and its error distribution, which are useful for our main 
results in the next section when the driving process Y in (1) is a continuous 
local martingale. 

To simplify our notation, first we assume that y G and let 




When d = l, let f{xi) =F{xi); when d = 2, take 




and 




(44) 




An integration by parts shows that 



[nt] 



3ivr(y) = ^(y./n - i(.-i)/n)' + {Yt - y„w)' 



(45) 



n 




i=l 
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Theorem 3.1. Assume that Y is a continuous process in M} with finite 
variation. There is equivalence between the following: 

(a) Equation (46) holds almost surely; 

(46) Yt= [ Vsds, [ \ys\^ (Is <QO. 

Jo Jo 

(b) The sequence of random variables {4"'A'^^"}„>i is tight. 

(c) The sequence of random variables {n^A^f }„>i is tight. 

(d) sup„ n^A'"f < oo a.s. 

(e) The process n^N'^ converges a.s. uniformly in time to a process N . 
In this case we have Nt = \ JoUsds and sup„ n^/Q^ I'^A'^J'I < oo a.s. 

Proof. The implication of (e) (d) =^ (c) =^ (b) is obvious. Let A 
be the set of all u> such that (46) does not hold, and B = {u>: — > 
oo}. By Lemma 3.1, we have A C B and then (b) implies that P{A) = 0. 
So (a) follows. By Lemma 7.2, (a) implies (e) and Nt = ^/py^ds. Since 

\dN^\ < J^{Ys - %is)? dYs, where Y = Jo \ys\ ds, and Y satisfies (46), by 
Lemma 3.1, we have sup„n /o |diV,"| <(X) a.s. □ 

Lemma 3.1. Assume that Y is continuous in with finite variation 
and deterministic. Only two cases are possible: 

(a) Equation (46) holds and sup„ n^A^" < oo. 

(b) Equation (46) does not hold and 4"'A/"f" —>■ oo. 

Proof. We can use the same notation and arguments as in the proof of 
Lemma 4.2 in [2]. A modification is that (4.3) in [2] is replaced by n^Nf = 
J{Lnf dX = J{LnYVndu, where Ln{s) = ra(yj/„ - Y(j_i)/„) for s £ ((i-l)/n, 
i/n] (by the way, in (4.3) of [2] fi' should be replaced by i^). □ 

Now consider the case that Y G M^. Recalling the notation of N'''^'' and 
j^nijk ^g-j g^j^j ^y-j^ have the following theorem. 

Theorem 3.2. Assume that Y = iY"^ , . . . ,Y'^Y is a continuous process 
in with finite variation. For each i = 1, . . . ,d, there is equivalence between 
the following: 

(a) We have 

(47) y/= / yids, [ \yifds<oo a.s. 

Jo Jo 

(b) The sequence of random variables {4"'A'^^"***}.„>i is tight. 
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(c) The sequence of random variables {n^A^™**}„>i is tight. 

(d) sup„ n^A''" ™ < oo a.s. 

(e) The processes n^A^"*** converges a.s. uniformly in time to a process 

// the above cases hold for all i = 1, . . . ,d, then for any i,j,k = l,...,d, then 
the processes n^A^"*-'*^ and n^M^^^^ converge a.s. uniformly in time t to 
processes N"^^^ and K'P^^ , respectively, where 

Nl^' = \fy^,yiyUs and u't = 1 f y'sViyUs. 
Jo Jo 



And we also have sup^n^/J^ \ dN^'^^'^\ < oo and sup^n'^ Jq |fiM"*-'*^| < oo a. 



s. 



Proof. By Theorem 3.1, we have that (e) ^ (d) ^ (c) (b) =^ (a). 
By Lemma 7.2, we have (a) ^ (e), and the uniform convergence in the last 
claim. From (59) and Lemma 7.1, it follows that 



Jo Jo 



s 

\ 

r 



yj")^' dY} 

n{s) 

1 / rs rs 



yl\ds 



f ([ \yt\dr [ \y^\dr]\yi\ds <oo, 

Jo \Jn(s) Jn(s) ) 



'0 \Jn(s) J n(s) 

.2 / I jArmjA;! ^ / / |„,.7|J^ / |„,fc| 



\dN^'i''\<n' / Wr\dr |y^| dr|y^| < oo. 

Jo Jo Jn{s) Jn(s) 

So we have the uniform boundedness in the last claim. □ 

Recalling the notation that C/"^ = XJ^^ — , X"^ is the first component 
of the Milstein scheme X"^ defined in (4), and Xl is the first component of 
the solution X of SDE (1): Xt = XQ + /q f{Xs) dY^. Let 

G'^ix) = Hf'i{X,)[f{Xs)Y + fi^{x)[Df\x)Y. 

k=l 



Theorem 3.3. When Y = {Y^,...,Y'^Y is a continuous process with 
finite variation, there is equivalence between the following: 

(a) For any starting point xq G and any function f with linear 
growth, the sequence of random variables n'^ Jq \dU^\ is bounded a.s. 

(b) For any starting point xq and all functions f with at most linear 
growth, the process n'^U'^ converges uniformly to a limit U a.s. 
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(c) For each i, the process has the form 

Yl = I y\ds with [ |y*|^ds<oo. 
Jo Jo 

In this case, let ys = {yl, . ■ . , y^Y , then the limiting process U = ([/^, . . . , V^Y 
is the solution of the following linear equation: Uq = 0, and 

Ul= j\usYDf\Xs)ysds-\Y. f y:G'^{Xs)f{Xs)ysds. 
Jo ~{ Jo 

Proof. That (b) (a) is obvious. By Theorems 2.2 and 3.2, we have 
that (c) =^ (b) and the last claim. The implication of (a) =^ (c) can be 
proved by the notation and results in the proof of (ii) =^ (i) in Theorem 2.2. 
First, since X'^^'^^ uniformly converges to X^^^ = e^'' (by Theorem 2.1) 
and Y^ is continuous on [0,1], for fixed u ^Q., there exist two constants 
a and b such that < a < X^^^^ < b for n large enough and all < t < 1, 
k = l,...,d. Now from (25)-(29), n'^ \dM^^^^\ is bounded a.s. Similarly, 
from (41), n'^ Jq \dN™^^\ is bounded a.s. (c) follows from Theorem 3.2. □ 

4. Continuous local martingales. To simplify our notation in the proof 
of our main result, we first assume that the driving process Y in SDE (1) 
is a continuous local martingale in M}, null at zero. With the notation of 
Nf^iY) in (44), we have the following: 

Theorem 4.1. Assume that Y is a continuous local martingale in "M} 
null at zero. Let Ct = be the quadratic variation process ofY. There 

is equivalence between the following: 

(a) There is a predictable process c such that 

(48) Ct= I Cgds, [ \cs\^ds <oo a.s. for < t < 1. 
Jo Jo 

(b) The sequence of random variables {nNi{Y),n > 1} is tight. 

(c) The sequence of random variables {n'^[N"'{Y), N'"'{Y)]i;n > 1} is tight. 

(d) The sequence of random variables {n^iV"(C),n> 1} is tight. 

(e) sup„>;^ n^iVf (C) < oo, a.s. 

Proof. The equivalence between (a), (d) and (e) can be deduced from 
Theorem 3.1. The equivalence between (b) and (c) comes from the fact 
that, for a sequence of continuous local martingales M", {Mf ,n > 1} is 
tight if and only if {[M^,M^]i,n > 1} is tight. This is because there are 
two universal constants ai and 02 such that aiE([M", M"]^) < (EMp)^ < 
a2E([M", M"]"^) for any stopping time T. (a) ^ (b) is obvious after we 
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prove the weak convergence of nN™^^ in Theorem 4.2 under the assumption 
of (a), see the last paragraph of the proof of Theorem 4.2. 

Next we show that (c) (d). Let r(n, p) = inf{t : n2[iV"(y), iV"(y)]t > 
p], and r(p) =inf{t:Ct >p}, T = T{p) AT{n,p), and n = T A j^. Note that 
for a continuous local martingales Mt null at zero, by the Ito formula, = 
6jQM^dMs + 15 jQMfd[M,M]s. If [M,M]i is bounded, then J^M^dMs is 
a martingale, hence, EMf = 15E /g d[M, M]s. By the Burkholder-Gundy 
inequality, there is a constant k such that E[M, M]f < kKM^. Therefore, 

E[M,M]I <15kE t M^d[M,M]s. 
Jo 

Now let Mt = Y- y^r.^i . Since [M, M]t = C- Ct^n^, , 
Recalling the notation of (45), 

n 

EAr,V(C)=Ei^(C7-aA.,_j' 

i=l 
rtAT 

< 5kE {Ys - y(,_i)/„)^dC, = 5KE[N^{Y),N'''{Y)]tr.T. 

Let t = 1 in the above inequality, then E[n^iV"^y(C)] < hnp. Now for any 
?>0, 

P(n2iVf (C) > g) < P(n2iVf^^(C7) > g) + P(r < 1) < 5Kp/g + P{T < 1). 
Since P(T < 1) goes to uniformly in n as p — > oo by (c), we have 

lim snp P(n'^N^(C) >q)=0, 

which implies (d). □ 

Now we assume that the driving process Y = (Y^, . . . , Y'^Y in SDE (1) is 
a continuous d-dimensional local martingale on {Q,J^,J^t,P), null at zero. 
Let Ct = (Cj*"')i<ij<rf be the quadratic variation processes of Y, that is, 
C^"* = [y*,y-']t. Our main result is as follows. 

Theorem 4.2. Assume that Y is a continuous local martingale in R'^ 
null at zero. There is equivalence between the following: 

(a) There is a dx d nonnegative matrix-valued predictable process c such 
that 

(49) Ct= [ Csds, f \\csfds<oo forO<t<l. 

Jo Jo 
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(b) For each i, the sequence of random variables (r.v.) {nN™^^,n > 1} is 
tight. 

(c) For each i, the sequence of r.v. {n^[A^"'™, A^"™]i, n > 1} is tight. 

In this case, and if (Jt is a d x m matrix-valued process such that ct = 
o'tO'J (a'^ is the transpose of a), then there is a Brownian motion W = 
{W^ , . . . ,W'^'-Y on {n,T,Tt,P), a possible enlarge space of {n,T,J^t, P) , 
such that Yf = JgasdWg. The sequence {nM^^ ,nN^^)i^j^d stably converges 
in law to processes {M^ ,N^)i<j<d, given by 

(50) Ml = ^Y. ^sai^dB^al, 

(51) ^/ = ^E / '^sai^dVral, 

where = {Bf'^)i<ij<rn, Vf = {Vl"^)i<ij<m and for l<p,i,j < m, 

yP'^ = V2/2BP'^ + V2/2BP^' + {V3/2WP + l/2WP)l{i = j), 

where l{i = j) = 1 if i = j, otherwise, and {B^^^ ,WP\1 < p,i,j < m} is 
a sequence of independent standard Brownian motions on an extension of 
space {Cl,T,Tt, P) on which Y and W are defined and independent ofY and 
W. Moreover, we have that {nM'^,nN"') has (★) and {Y,nM"' ,nN"') =^ 
{Y,M,N), that is, the statement (i) in Theorem 2.2. 

Remark 4.1. The limits of the quadratic variation of {nM'^,nN^,W) 
are listed in Lemma 7.6. 

Proof of Theorem 4.2. By Theorem 4.1, we have the equivalence 
between (b) and (c). Next we show that (b) ^ (a). Since Ct — is a non- 
negative matrix for any t > s > 0, [Cl^ - C^Jf < [Cf - Cf][Cf' - Cf ], the 
Cauchy inequality implies that 

[Nl'{C'^)f < NJ'{C'')NJ'{C^^). 

By the notation of iVf ™ in (7) and iVf (^0 in (44), we have A^f ™ = Np{Y'). 
From the implication of (b) (e) in Theorem 4.1, sup„ n^A'^"(C**) < oo for 
all i. Therefore, sup„ n^A'^f (C*-') < oo for all i and j. By Theorem 3.1, we 
conclude (a). 

It remains to prove that (a) implies the last claims, since (b) or (c) follows 
from the last claims. By Lemma 7.6 and Theorem 4.1 of [1], the sequence 
(nM^P ,nN"-P)i<p<d stably converges in law to the process {MP , NP)i^p<fi 
of (50) and (51). By the same arguments as [2], it follows that the triple 
(y, nM"-P, nN''^P)i<p<d converges in law to (Y, M^, iV*')i<p<d for the product 
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topology on ©(M*^) x ©(M^'^^), and since all these processes are continuous, 
we also have convergence for the Skorohod topology on D(R'^"''^'^ ). Finally, 
Lemma 7.6 implies (★) for the sequence (nM"', nA^"). □ 

Theorem 4.3. For a continuous local martingale Y = {Y^ , . . . ,Y'^Y 
with quadratic variation Ct = \Y,Y]f, there is equivalence between the fol- 
lowing: 

(a) For any starting point xq £ and any function f with linear 
growth, the sequence of random variables nU"^* is tight, where X* means 
that supo<f<i \Xt\. 

(b) For any starting point xq and any functions f with at most linear 
growth, the sequence {Y,nU"') converges weakly to limit {Y,U). 

(c) We have (49). 

In this case, the process U = {U^, . . . , W^Y is the solution of the linear equa- 
tion (8), with M,N given in (50) and (51). 

Proof. That (b) =^ (a) is obvious. From Theorems 2.2 and 4.2, we have 
that (c) (b) and the last claim. The implication of (a) (c) can be proved 
by the notation and results in the proof of (ii) (i) in Theorem 2.2. First, 
from (25)-(29) and the tightness of V^* and C/'"^*, that is, the tightness of 
jjnijk^ we deduce that uM'^'^^^ has (*). From (41), the sequence uN"'^^^ is 
tight and satisfies (*). Then (c) follows from Theorem 4.2. □ 

5. Continuous semimartingales. Now we consider the case that the driv- 
ing process 1" is a continuous semimartingale. We assume that Y = H + A, 
where ^ is a continuous adapted process of finite variation and H is a 
continuous local martingale, both being d-dimensional and null at 0. Let 
Ct = {Cl'^)i<i j<d be the quadratic variation process of Y, that is, C^-^ = 
[Y\Y^t = [H\W]t. 

We do not have the necessary and sufficient conditions for the error pro- 
cess [/" to converge at the rate 1/n. Partial results are available when At has 
the form 

(52) ^t = i o.\ds with / |a*p(is < oo a.s. 

Jo Jo 



Theorem 5.1. Assuming that Y is a continuous semimartingale, for 
the following statements, we have that (a) and (52) imply (h), and (b) 
implies (a). 

(a) There is a dx d nonnegative matrix-valued predictable process c such 
that 

(53) Ct= Cgds with I \\cs\\^ds<oo a.s. 

Jo Jo 
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(b) The sequence (nM^^ ,nN^^)i<p<rn, in (6) and (7), has (*) and is 
tight. 

And furthermore, under (a) and (52), if Ht = fQCTsdWs, where ct = atcrl 
as in Theorem 4.2, then the sequence {nM^^ ,nN'^^)i<p<d stably converges 
in law to the processes {M^ , NP)i<p<d, where N[ = Nf + 1/2 /q c,af ds, and 
M,N are given in (50) and (51). Moreover, we have 

i<p<d ho-s {-k) and 

(y, nM"^ niV"P)i<p<rf =^ (y, AF, NP)i<p<d. 

Remark 5.1. The limits of the quadratic variation [uM'^ ,nN^ ,W) are 
hsted in Lemma 7.6. 

Proof of Theorem 5.1. By the integration by parts formula, we have 

(yH)(yW)-=r yW(dy,)-+ r dy,(y,("))" + cj"). 

J n{s) Jn(s) 

According to the notation of Af"^ and A^"^ in (6) and (7), we have 

Jo 

By Theorem 2.3(a) in [2], under (b), we see that the sequence n /q ci"^ dY^ 
is also tight and has (★) for any p = l,2,...,d. Particularly, the sequence 

n J^ci^^^^dYf is tight and has (*). Then its quadratic variation processes, 
n loiCF - CnU^^ dCPP = n'^NfiCPP), are tight too. Theorem 3.2 implies 

that sup„ n^Af (C^P) < oo. By the same reason as in the proof of (b) (a) 
in Theorem 4.2, we can conclude (a). 

Now we assume that (a) and (52) hold. By the usual stopping time argu- 
ments, we can also assume that /q ||as|p(is and /q ||cs|p are bounded by a 
constant a. 

First, uN^P can be decomposed as follows: 

nN'^P = F^P + F^P + F^P + + (Fg"^ + F^)" + F^p, 

where = nj^ /fi")(i/i"))- dRP, F^p = nj^ Ht\H'f'^Y dAf , FJ^p = nx 
/o* i^i") ( Ai" V dHP, = n /o* ifi") (yli") )- d^P, Fg"^ = n ^1"^ (^li") Y dYf. 
Since, for any i, n(^^("))2 = n(4 - A^^^^^f < ^(^^'/"(a^j^ < a, by (52) 

and the Cauchy-Schwarz inequality, n(^s^"^)^ goes to zero uniformly for 
s £ [0,1] as n — > 0. Thus, F^p has (★) and weakly converges to 0. By (81) 
and (82), F^^ and F^^ have {-k) and weakly converge to 0. By (80) and The- 
orem 4.2, F^P + has (*) and weakly converges to Nf = Nf + ^ /q c^af ds. 
Putting them together, nN^P has (★) and weakly converges to Nf . 
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Second, nM^^ can be decomposed similarly as follows: 

M"P= f GldHP+ f GldAl+ f G'idYi+ f G'ldY^ + f' G^dY^, 
Jo Jo Jo Jo Jo 

where = //("^ dH^, G^ = //^^ dA;, G^ = Ai^^ dH; , G^ = 

/n%)^^"^^^r- When H and A are a one-dimensional process, from 

(84) and (85), we have n J^G^ dYf 0; from (81) and (82), we have 

n /o* dYP 0; since + = , we have n G^ dYP 

0. Therefore, n £G^ dYP + n £ G^ dYP has (,^) and weakly converges to 0. 
Since n(^s^"^)^ goes to zero uniformly for s G [0, 1] as n — > oo as shown 
above, so does nG^. Thus, jQG^dYP has (*) and weakly converges to 0. 
By (83), n£G1dAP has (★) and weakly converges to 0. By Theorem 4.2, 
we have n j^GidYP has (*) and weakly converges to . So, nM'^P has (★) 
and weakly converges to . □ 

Combining Theorems 5.1 and 2.2, we get the rate of convergence 1/n for 
U"^ = X" — X, when 1" is a continuous semimartingale. 

Theorem 5.2. Assume that Y = H + A is a continuous semimartingale 
in Mf^ null at zero. If (52) and (53) hold, then for any starting point xq and 
any functions f with at most linear growth, the sequence {Y,nU^) con- 
verges weakly to a limit (Y, U), the process U = {U^, . . . , V^Y is the solution 
of the linear equation (8), with M given in (50) and N replaced by N , which 
is given in Theorem 5.1. 

6. Examples. For the Ito-type SDE in (2), Xt = xq + J^a{Xs)dWs + 
/g b{Xs) ds. If a, b are G^ functions, then we can use the Milstein scheme X" 
defined in (4) to approximate the solution Xt, 

= + aWt'' + bt(^^ + iaa'((T^("))^ - *(")) 

+ i66'(i("))2 + / a6's(") dW, + [ aft'wi") ds, 

Jn{t) Jn(t) 

where VF/"^ = Wt — W^it) > = t~ 'n{t), and function a, a', b, b' taking values 
at Xn{t) - Compared with the Milstein scheme defined in (3), this scheme has 

two more terms, /^^^-j ab's^"^^ dWs and /^^^^ a6'wi"^ ds. 

In order to apply Theorem 5.1 to find the asymptotic error U of nU"" = 
n(X"' — X) for the scheme in (54), we use the following notation: / = 
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(a, 6), Yt = {Wuty, Ht = iWt,Oy, At = (0,*)^ Let = yi")(yi"V, Gs = 

and 



By Theorem 5.1, [nN 










JO 




Mj"i = ^ 


fCsdWs, 






= f Gsds. 
Jo 








nlll-j _ 


^(^111^ ^211^ ^^111 


), where 


M"i = 


"7! 




2^/3 


\wt, 



and B^,B^,W are three independent standard Brownian motions, defined 
on an extension of the probability space on which W is defined. We also 
have that the limits of all other entries of nN"-^,nN"''^,nM"'^,nM"''^ are 
zeros. Therefore, the limit process U of — X) satisfies the following 

linear SDE: 

Ut = J' Usa{Xs)dWs + Usb'{Xs) ds-^f a^{Xs)b"{Xs) ds 

If a(-) is a function and b{-) is a function, we can use the following 
simpler Milstein scheme X" rather than (54): 

= xo + b{X:^^^)ds + a(X„"(,)) dWs + a(X„"(,))a'(X„"(,))Tyi-) dW^ 

to achieve the same rate of convergence, that is, n(X" — X) =^ [/, and U 
satisfies 

C/t = /* Usb'iXs) ds + U,a'{Xs) dWs -If' b{Xs)b'{Xs) ds 
(55) ^° ^ ^ 

- ^ ^* Co(X,) dt^, - ^ / C-il^^) - ^ C2{Xs) dBl 

where W, B^,B^ are three independent Brownian motions and Cq, Ci, C2 are 
three functions dependent on a and b. 
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7. Some lemmas. In the next two lemmas we assume that X,Y and Z 
are three continuous processes with finite variations, which satisfy 



(56) 



Xt= [ Xs ds, Yt= [ Us ds, Zt= [ Zs ds, 
Jo Jo Jo 



^,y,zGL3([0,l]), 



where LP{[0, 1]) = {x\ \\x\\ = {Jq \xs\Pdx)^/P < oo}. 



(57) 
(58) 



(n) 

Lemma 7.1. Let Xs = Xs — Xn(s), then for any <t<l, 

' < lk||3||?/||3||2;||3, 



n 



n 

10 Jn(s) 



Xi") dYr dZ, 



Proof. By the Holder inequahty, < (s-?T-(s))^/^(g) \xr\^ dr. Then 

||X(")||^< [\s-n{s)f r \xr\^drds<ir\\x\\l. 







n(s) 



It follows thatn2|/o*xfVrMZ,| <n2||X(")||3||y(")||3||z||3< ||x||3||y||3||z||3, 
which proves (57). Since 



(59) 



Xi") dYr 



ln{s) 

(58) follows. □ 



< 



n{s) Jn(r) 



Xu\du\ys\ds < \xu\du \yr\dr, 



n{s) 



n{s) 



Lemma 7.2. As n ^ oo, the following uniform convergences in t holds: 



(60) 
(61) 
(62) 



n2 / dZs i 



ds, 



n 



f X^J^^Y^'UZs ^ U XsVsZsds, 
Jo Jo 

f r Xl^^^dYrdZs \ f XsVsZsds. 

Jo Jn(s) Jo 



Remark 7.1. When x,z £ L^([0, 1]), from Theorem 4.1 in [2], we have 

(63) n / X^"^^ dZs — > i / XsZs ds uniformly in t as n — > oo. 
JO Jo 

When II X II 2 and ||z||2 are bounded, we have convergence in (63). 
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Remark 7.2. By Cauchy's inequality, we have the following, which will 
be used in the proof of Lemma 7.7: 

(64) n r\X^"'^\ds< \xs\ds and \X^'^^\^ ds < T \xs\'^ ds. 

Jo Jo Jo Jo 

Proof of Lemma 7.2. To prove (60), first we assume that Xg £ C^, 
therefore, Xg and dxg/ds are bounded by a constant a in [0,1]. By the 
mean value theorem, there exists an s between n{s) and s such that xi""^ = 
Xs{s — n(s)). Since "n? /q (s — n(s))^ ds t/3, and 



n 



{xsf{s - n{s)) Zgds 



{xsfzs ds, 



we need only to show that n'^ Jq{{xs)'^ — {xs)'^)(s — n{s)Y Zgds 0. This is 
true because [(xg)^ — (xj)^| <2a^/n and /q (s — n(s))^Zs — > ^ /q ds. 
Second, if Xs^ C^, since Xs £ L"^, for any e > 0, there exist a Xg £ such 
that ||x — xllg < e. By (57), 



n 



n{s) 



Xr dr I dZ, 



n 



Xr dr I dZ,^ 

n{s) 



n 



rt rs rs 

/ / {Xr + Xr)dr / {xr — Xr)drdZ, 
Jo Jn{s) Jn{s) 



Now we can claim that (60) holds for all x G . Since 2ab = {a + by 



(61) follows by (60). (62) can be proved by the same arguments as in the 
proof of (60). Here we just give two key steps. It is easy to see that (62) holds 
when Xg^Vs £ ■, since there exist s and f between n(s) and s such that 

/n(s) ^r"^^ dYr = XsVf !n{s) Jn(r) = XsVfis - n{s)f /2. And for x,y,x,y, 
by (58), 



n 



Xu du Ur dr dZs 



n 



J n{s) J n{r) 



Xu du Hr dr dZg 



13 + ll^/lblF ^IIsIfIIs- 



'0 Jn(s) Jn{r) 

< Ikllsliy-yibii^ii 

If Xs^Vs and Zg are all nonnegative, both sides of (60), (61) and (62) are 
all continuous and nondecr easing, these convergences are uniform in t over 
[0,1]. This uniformity can be easily generalized for all X,Y and Z which 
satisfy (56). □ 

In the proof of the next two lemmas, let Cp be the constant in Burkholder's 

inequality, that is, E|yt|^ < ^^^^^[y, y]j^^ for any continuous local martin- 
gale Y . 
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Lemma 7.3. Let W, B, U and V be four independent standard Brownian 
motions and n{s) = [ns]/n. If we write Wg^^ = Ws — VFn(s); then for <t < 



(a) 


n2 


Jo 




(b) 


n2 


Jo 


W^""^ B'f^ B'i''^ ds^t/3 


(c) 


n2 


Jo 




(d) 


n2 


Jo 




(e) 




Jo 





Proof. We prove the lemma by the simple fact that a sequence of ran- 

dom variables — ^ the limit of if the variance of ^n, Var(^„), con- 

verges to zero as n— >oo. Now for any four standard Brownian 
motions ^W"^ (not necessary to be independent), let ^i"^ = 
^i{n)^2{n)^3(n)^4(n). Cauchy's inequality, we have E(^i"^)2 < 

E(VFi"^)8 <C8[s- n{s)]^. Therefore, 



Var (^n^ ds^ < n^E ^ 



l/n X 2 

^ ds 



l/n 



l/n 



< Csn^ / sUs< Cs/n, 
Jo 



which implies that the variances of the left-hand sides of (a), (b), (c), (d) 
and (e) converge to zero as n — > oo. Next we calculate the expectations of 
the left-hand sides. Since the expectations of the left-hand sides of (c), (d) 
and (e) are and 

En^ l\w^''^fds = n^ f\{s - [ns]/nf ds 
Jo Jo 

= [nt]/n + {nt - [nt\)/n^ — > t, 



En^ / (Wi")5("))2(is = n2 



(s — [ns]/n)^ ds — > t/3. 



we can conclude all of the I? convergences. □ 
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Lemma 7.4. Suppose that for any pair from four standard Brownian 
motions W,B,U and V, they are either identical or independent, then for 
0<t<l, 

rt 



(a) /*( /' W^^'UBr r Uj^^'UVrlds 

Jo [Jn(s) Jn(s) ) 



'0 KJn{s) Jn{s) 

L\ \ t/6, ifW = U andB = V, 
\ 0, otherwise; 

( t/3, ifW = B = U = V, 

L 0, otherwise. 

Proof. We use the same arguments as in the proof of Lemma 7.3 to 
prove the convergence. Let 

Xs= r W.^^'^dBr and = T d^. 

Jn{s) Jn{s) 

Fust, for the variance of the left-hand side of (a), we have 

\Jn{s) ) 

< C74(s - n(s))E r [VFj")]^dr 

J n(s) 

= 6*4(5 — n{s))^. 
Similarly, E,{Ys)'^ < C4(s — n{s))'^. By the Cauchy inequality, 

rl/n 

<n^ {E{XsfE{Ysff^^ds 
Jo 

<nM C4{s-nis)yds = C4/{57i)^0, 





as n ^ 00. For the expectation of the left-hand side of (a), we use the in- 
tegration by parts formula (see page 60 of [10]). If B and V are indepen- 
dent, then E{XsYs) = E[X,Y]s = 0.lfB = V, then E{X,Y,) = E[X,Y]s = 

^In{s) W^i"''C^r"-' dr. Now if W and U are independent, then E(wi"Vi"^) = 

0; if = U, then E(wJ"^?7i"^) = r - n{r). Therefore, ii B = V and W = U, 



rt rt rs 

lim En^ / XsYsds= lim / (r — n(r))drds 

Jo ^0 Jn{s) 
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= lim nV2 / (s - 7i(s)f ds = t/6; 

otherwise, it is zero. We finish the proof of (a). Second, we claim that the 
variance of the left-hand side in (b) goes to zero as oo. This is because 



ln{s) 

JO Jo 



10 JO 

l/n 



< VQC^n^ / (s - n{s)y ds < VC^/(5n). 







Next, we prove the convergence of the expectation of the left-hand side of 
(b) to the right-hand side of (b). For the first case that W = B = U = V , 
since = 2/„*(^) 1^^"^ dWr + s - n{s), 

eL^ Tcwi"))^ r W^''UWrds}=2n^ TeI T W^'''> dWrV ds 

I JO Jn(s) J JO [Jn{s) ) 

i-t 

= n^ {s — n{s)) ds^t/3 







as n — > oo. 



For the second case that W = U ^ B = V, since W and B are indepen- 
dent, M^i^^si"^ = /^(^) lyj"^ dBr + -B^"^ (iVF,.; taking expectations of the 

squares of both sides, we have E{/^(g) VfJ"^ /^(j,) -B^"''' dWr} = 0. There- 
fore, 

E|n2 /* VFi")5(") dS^ ds\=r? Ce( r W^''^ dBr] ^ t^s 

I Jo Jn(s) J Jo \Jn{s) / 

71^/2 l\s-n{s)f ds^t/Q 

as n — > oo. 







The third case is equivalent to the second. It is easy to see that the expec- 
tations of both sides in the forth case are zeros. □ 



Now, let Y be the continuous local martingale defined as the beginning 
of Section 4, that is, Yt = Yt{a) = j^asdWa- In order to prove Theorem 4.2 
in the general case of cr, we define another continuous local martingale Yi, 
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on the probability space where Y and W live on, as follows: Yj = Yt{a) 
jQasdWs, where a is a d x m matrix- valued process. We write 



(65) 



(66) 



(7 



t rs 



Mr{<y)=l I Y,^^\a)dY/{a)dYi{a), 

Jo Jn{s) 

r] = i]{a,a) = (e \\as - ds^ 
where \\a\\ = (X]ji [^s"'P)^^^- ^t^^i^) ^'^t^i^) ^re defined similarly for Yt{a) 



Lemma 7.5. If Jq \\a\\^ ds and Jq \\a\\^ ds have an upper bound p, th 
there exists a constant C, which depends on p, such that 

n2E||[iV"f(a),iV"^(a)]t - [7V"^'(a), iV"''(a)]t || < Cr/, 

n2E||[M"P(a),M"''(a)]j - [KP^ {a) , M'^'^ {a)]t\\ < Cr,, 

n^E\\[N^P{a),M^^{a)]t - [N^^ (a) , M^'^ {a)]t\\ < Crj, 

n^E\\[N^Pia),W]t-[N^Pia),W]t\\<Crj, 



en 



(67) 
(68) 
(69) 
(70) 
(71) 



n 



'E||[M"P(a),VF]t - [M"P(a),VF]t|| < Cr,. 



Proof. Let C be the constant which changes from line to line. By the 
Burkholder and the Holder inequalities, for an integer k, 



(72) 



E(||yW||)2'=<c7(s_^(s))'="iE 



n(s) 



thus, /o E(||yi"^||)2*^ ds < C/n'' and also /g E(||yi"^||)2'= ds <C/n''. Similarly, 

/■*E(||yi") -Fi")||)2*^ds<C/n'=E f War - arf'' dr. 
Jo Jo 

Let A<Si B = (aijB) be the Kronecker product between two matrices A 



(aij) and B = {bum)- For (67), we write A = yi")(yi"))^ and B = Y^'iY, 
then by the Holder inequality. 



■-r?E 



Pa® AcP\a) -B® BcP\a) ds 
Jo 



<n^E / ll^i^^f |k-a||(||a|| + ll^ll 
Jo 
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+ 6rW-yi")||(||yi")f+||yWf)||af 

/ l-t _ \ 1/6 / /•< _ \ 1/2 

<Cn2^ E||yi")-yi«)f dsj ^ E(||yWf + ||yi")f)2d.j 

/ /-i \ 2/3 

For (68), we write A = y^^"^ d(y^)^ and B = again, by 

the Burkholder and the Holder inequalities and (72), 



< C 



E 



E 



n{s) 



\ 3/2n 2 



n(s) 



n{s) 



1/2' 



<Ce( r llyWfdrWfr Ik^fdr^ 

\Jn(s) / \Jn[s) J 



((s-n(s)) / [r-n(r)]2E 


I \\a uW*^ du 




V Jn{s) 


Jn{r) 





X E 



n{s) 



< C{s-n{s)r(K 



\ar\\ dr 



n{s) 



It follows that 



(73) 



E 



Similarly, 

E\\A-Bf < CE 



'ds<C f {s-n{s)fE f \\ar\ 

Jo Un[s) 

< 1/n^E Wasfds. 
Jo 



J n(s) 



'dr 



ds 



<CE ||y/") -y,(")f + 

J n{s) 



2|| - l|2 I 

(Tr — (TrW dr 



<c([s-n{s)fE r \\ar-afdrEf' \\arf + \\arfdr 

\ Jn{s) J n{s) 



1/2 
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By the Cauchy inequality, 







^E\\A-Bfds] <C f\s-n{s)fE I \\ar-afdrds 



(74) 





X /*E 

Jo Jn{s) 



ln(s) 

\o'r\\'^ + \\ar\\^ dr ds 



Jo 



Finally, (68) follows from (73), (74) and 

n^E||[M"^'(c7),M"''((j)]t - [M"P((j),M"«(a)]i 



f A® Acf^ia) -B® Bd"i{a) ds 
Jo 



<r?E 

<T?C 



E 



\a--a\\m^¥\\)^\\A-BmA\\^m)¥fds 

, \2/3 

1 r\ 

/ rt \ 1/2 f ,t \ 1/3 

+ n^cU E\\A-Bfds\ U E{\\A\\ + \\B\\f ds\ . 

Inequalities (69), (70) and (71) can be proved in the same way as above. □ 

Lemma 7.6. Let iV"P^J(cr) and M"P*-'(cr) be the entries of N^'Pia) 
and M'^P{a), respectively. Let be the ath component of the Brownian 
motion W iriM'^. Under the assumption of (49), for all 1 <i,j,k,l,p,q<d 
and 1 < a < m, and t G [0, 1], we have 



(75) 


n2[iV"P'^J'(CT),iV"«'^'(a)]t 


p 


3 


/ {^s ~^ '^s ~^ ^s^s )c^'^ ds 
10 


(76) 


n2[Ar"TO(cj),M"'?'='(CT)]t 


p 


6 


1 (''s '^s "1" '^s )'^s^ ds, 
10 


(77) 


n2[M"™((j),M"«'^'(CT)]t 


p 


6 


1 '^s ds, 

10 


(78) 




p 


2 


fci^afds, 

10 


(79) 


n[M"™((j),M^'^]t 


p 


0. 





Proof. Let Tp = inf{t : /q \\csfds > p). Since (49) implies that P{Tp < 
1) — > 0, it is enough to prove the lemma for the processes stopped at Tp, 
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which is equivalent to assuming that Jq \\cs\\^ ds is bounded by a constant p, 
or Jq \\(7s\\^ ds < p. Since ag E -^^^([0, 1]), for any e > 0, there exists a matrix 
function dg G C^([0,1]) such that \\as — as\\^ds < e, and ||cJs||^ds <p. 
If we denote the right-hand side of (75) as h{a), and denote it as h{a) when 
Cs is replaced by (TsctJ, then \h{a) — h{a)\ < \\as — as\\^ ds. We have the 
same results for the right-hand sides of (76), (77), (78) and (79). Therefore, 
by Lemma 7.5, it suffices to prove the lemma for as G C^([0, 1]). 

Now we define ag = (7n(s), where n{s) = [ns]/n. Since /g ||cr<j — a<j||^ds < 
K,/n for some constant k, by Lemma 7.5 again, we need only to show that 

[N^P'^{a),N^'i'^'{a)]t 1 f\d^cf + cfc^j + 4'c^?)c?'? ds, 

^ 

and the corresponding convergences in (76), (77), (78) and (79) where o is 
replaced by a. By the monotone class theorem and Lemmas 7.3 and 7.4, 

(a) n2[iV"P^i(a),iV"«'='(a)]t 

= ^ Taf afVf 'af cf (a)W'i")^Vi")^ Vi")'=Vi")'^ 
i'yk'v 

^ \ [\4^cf + 4V,' + 4'c','=)cf ds, 

Jo 

(b) n2[iV"W(a),M"«'='(fT)]j 
f cjfVf^^'af (a)VFi")*Vi")j' W^'''^^' dW'^ ds 

1,11.1,1 Jo Jn(s) 



i'j'k'l' 
P. 1 



(c^ 4+0^4 )^'^ds, 



(c) ?i2[M"P^^'(a),M"'?'='(^)] 



E 

i'j'k'l' 



t 

n I erf Gg Og cf[a) 



Jo 

(d) n[N^P^^ia),W% = J2n [ 

~77l Jo 



X / TyJ")*' dw^' / dwl' ds 

J n(s) J n{s) 



f'd^afds, 
Jo 

t 



2 



(e) n[M''P'^{a),W% = y n f af a^^' af Wj")^' dWf ds-^0 

Jt, Jo Jn{s) 



I'] 

We finish the proof. □ 
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Lemma 7.7. Let X^,X'^ be two continuous local martingales with their 
quadratic variation process Cl^ = [X^ ,X^]t = /q ds, and Xt,^^^ = XI — X^^^-^ . 



Let At and At be two adapted continuous processes of finite variation, with 
At = jQOsds and At = ^QOLsds. If fQ{as)^ds, jQ{ds)'^ds and Jq WcsW^ ds are 
bounded by a constant a, where ct = {ct)2x2, then 

(80) n /* dAs -^k t «s ds, 

Jo Jo 

(81) n t dXl ^ 0, 

Jo 

(82) n rxjW^")^^^ ^0, 

Jo 

(83) n /* r dX^ dAs 0, 

Jo Jn{s) 

(84) n /* A^.'^) dX^ dXl ^ 0, 

JO Jri{s) 

(85) n /* 4"^ dXl dAs ^ 0. 

JO Jn{s) 

Proof. The Burkholder inequality and the Cauchy inequality are used 
frequently throughout the proof, and the constant k changes from line to line. 
Let yi") =/„%), Gr(a) = n/*yi'^)a.d., = yj") d.. 

For (83), we need to show that G"(a) — > 0. By (64), we have 



Jo JO \Jn 



1 / rs \ 2 

cj^ I ds 

n{s) 



(86) 

<K/n'^E[ {cffds<Ka/n^. 
Jo 

Then by Cauchy's inequality, (64) and (86), 
E f\Yj''^fds<KE r {Xl^"''^fcfdrds 

Jo Jo Jn{s) 

1 rs ... /•! /-s \ 1/2 



<K(Ef r {Xl^^'^fdrdsE ( r {cffdrds 

\ Jo Jn.(s) Jo Jn(s) 

< K/n{E j\xl'^''^fdsE j\cf)Us 
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Since 1^^"'' is a martingale with mean zero, K[Z'''Z^) = for i ^ j. Again by 

ni/n 
ki-l)/r 



the Cauchy inequahty, {Z^f < f'\.,{Y^''^Yds/n. By (87), 



n „i 

i=i -^0 

It follows that G"(l) — > 0, which implies that G^{a) — > for a step func- 
tion a. In the general case there is a step fmiction b such that E /o^(as — 

^ 0. Since GH&s) and 



Ef sup \G'l{a)-G'l{h)\ \ <tm( [\Y^^\as-hs)\ds 
\o<t<i / \Jo 



<nlEJ {Vj'^^fdsEJ {as-hsfds\ 

/ A \l/2 

< K^/afE j {as - bsfdsj , 



G'i{a) — > for any function a. So we have (83). Equation (80) follows from 
(83), (63) and the following integration by parts formula: 



J n(s) Jn(s) 



Now let 7„ = supi<j<„/(y_^-^)^^(ar.)^rfr Since 7„ < a and 7„ 0, then 

E[7„] ^ and E[7„]2 ^ 0. By the Cauchy inequality, n(Ai"^)2 < 7^. By (86), 
we have the following: 

(a) E / (X]("))^ds< V^/n, 
Jo 

ft \ 2 



(b) ^(nj^ xI'-'^^'aIpUAs 

<E n{Xl^''^f dsE r n{A'f^^f{cf)^ds<a^E[jn], 



(c) E(^n^ 

<AcE^*n2(XiW)2(4"))2cf ds) 

< k(^E^' n2(Xi(«))^ ds) (kJ^ n2(4"))4(cf )2 ds^ 



< K;a\/E^. 
Thus, (81) and (82) follow. 
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Next, we will prove (84) and (85). Let = /^^) A^' dX}.. Since 
"^inmf < KTi^Ef r (4"¥c^^ drY < Kn^Efjn /' cj^ dr) 

\Jnis) J \ Jn{s) J 



by (64), we have 




from which we get (84) and (85), since E[7^] ^0. □ 
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